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Information  Capacity  of  the  Stationary  Gaussian  Channel 


I.  INTRODUCTION 

Information  capacity  of  the  continuous-time  stationary  Gaussian  channel 
(per  unit  time,  as  transmission  time  T  -»  00 )  has  been  a  problem  of  much 
interest  since  the  early  days  of  information  theory.  Results  date  back  to 
Shannon’s  1949  paper  [12],  which  contains  an  analysis  of  the  band- limited 
channel.  The  treatment  given  in  Gal  lager’s  1968  book  [6]  is  commonly 
referenced  as  "the"  solution  to  the  capacity  problem.  In  fact,  however,  the 
class  of  channels  that  fit  the  model  of  [6]  is  quite  limited,  due  to  the 
nature  of  the  constraints  that  are  applied.  These  constraints  eliminate  from 
consideration  a  very  large  class  of  channels  that  can  be  important  for 
applications. 

The  present  paper  contains  an  analysis  of  the  capacity  problem  that 
applies  to  those  channels  not  fitting  the  model  used  in  Gallager’s  book  [6]. 
The  development  is  also  quite  different  from  that  of  [6];  it  is  based  on  the 
results  of  [3].  It  will  be  seen  that  the  treatment  given  here  provides  a 
desirable  generality  on  the  class  of  transmitted  signals  which  is  not  present 
in  previous  treatments. 

The  model  of  the  SGC  (stationary  Gaussian  channel)  is  given  by 

Y(  t)  s  X(  t)  +  Z(  t)  ,  t  €  IR  =(-»,»),  (1) 

where  X  =  (X(t)},  Y  =  (Y(t)}  and  Z  =  (Z(t)}  represent  the  channel  input, 
output,  and  noise,  respectively,  Z  is  a  stationary  Gaussian  process,  and  X  is 
a  stochastic  process  independent  of  Z.  All  stochastic  processes  mentioned  in 
this  paper  will  be  assumed  to  be  measurable  and  to  have  zero  mean. 
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Shannon  [12]  (see  also  Fano  [5,  Chap.  5]  and  Pinsker  [10])  studied  the 


capacity  per  second  of  the  SGC,  defined  by 


C{V)  =  sup  lim  1  It(X.Y). 

xe/  T-*° 

T  T 

where  Lj,(X.Y)  =  I(Xq,Yq)  denotes  the  mutual  information  between  the  input 

T  T 

Xq  =  (X(t):  0  £  t  <  T}  and  the  corresponding  output  Yq,  and  the  supremum  is 

taken  for  all  admissible  inputs  X  =  (X(t)}  €  V.  The  class  Sf  of  admissible 

inputs  is  given  follows,  fiiwn  »  stationary  Gaussian  proces0  w 

with  a  spectral  density  function  (SDF)  f^  and  a  constant  P  >  0,  an  admissible 

input  is  a  wide-sense  stationary  process  X  having  a  SDF  f^  satisfying 


fx(X) 

yxj 


dX  i  2wP. 


Formulas  for  C(y)  have  been  given  for  special  cases  ([5] . [10] , [ 12]) .  However. 

it  is  known  that  there  are  some  mathematical  difficulties  to  derive  the 

formulas.  In  this  paper  we  shall  derive  the  capacity  (see  (16)  and  (22))  in  a 

rigorous  way,  assuming  appropriate  conditions  on  the  processes  W  and  Z.  These 

assumptions  imply,  in  particular,  that  the  reproducing  kernel  Hilbert  space 

(RKHS)  of  the  process  W  is  the  same  as  the  RKHS  of  Z. 

In  some  previous  works  ([10], [12];  e  stationarity  of  inputs  is 

assumed.  To  remove  the  assumption  of  stationarity,  we  introduce  a  class  &  of 

admissible  inputs  which  we  show  to  contain  Sf  under  rather  mild  assumptions. 

Denote  by  IMI^  the  norm  of  the  RKHS  H^  ^  corresponding  to  the  process 
T 

WQ  =  (W(t):  0  i  t  <  T).  The  class  31  is  the  set  of  all  processes  X  satisfying 

Hm  i  E[IIxJhJ  T]  i  P.  (2) 

Th» 


Under  modest  assumptions,  it  will  be  shown  that  the  capacity  C(9 1) ,  defined  in 
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the  same  manner  as  C(y’).  is  equal  to  C(y).  This  means  that,  even  if  we  do  not 
require  the  stationarity  of  inputs,  the  capacity  is  attained  in  the  class  of 
stationary  inputs. 

A  constraint  similar  to  (2)  has  been  considered  by  Baker  [2], [3].  He 
determined  the  capacity 

CT  =  sup  Jr  It(X.Y). 

T  1 

Y* 

0 

T 

for  a  fixed  time  T.  where  the  supremum  is  taken  for  all  processes  Xq 
satisfying 

E[llxJllJiT]  $  P.  (3) 

T 

This  result  for  C  is  essential  for  the  derivation  of  our  main  results. 

The  average  power  constraints  on  the  input  studied  in  some  previous  works 
are  given  in  terms  of  the  noise  process  Z.  The  constraints  (2)  and  (3)  are 
stated  in  terms  of  a  process  W  which  is  different  from  Z.  The  SGC  subject  to 
(3)  is  called  a  mismatched  channel  ([2], £3]).  On  the  other  hand,  the  SGC  is 
called  a  matched  channel  if  W  =  Z.  The  capacity  of  the  matched  Gaussian 
channel  for  a  finite  time  interval  is  known  to  be  equal  to  £  P  ([1],  [7])  and 
is  not  increased  by  feedback  for  a  large  class  of  channels  [7]. 

In  [6,  Chap.  8],  Gal  lager  gives  a  comprehensive  treatment  of  this 
problem,  which  he  attributes  to  Holsinger.  As  mentioned  above,  his  setup  and 
assumptions  are  different.  Relations  between  the  Holsinger-Cal lager  result  and 
those  obtained  here  will  be  discussed  following  the  statement  of  our  main 
results.  It  will  be  shown  that  (when  f^  and  f^  are  rational)  the  assumptions 
of  [6]  are  much  more  restrictive  than  those  given  here. 

The  precise  definitions  of  the  capacities  are  given  in  Section  II.  In 
Section  III  we  give  the  statements  of  our  main  results,  with  some  necessary 
lemmas  given  in  Section  V,  and  proofs  in  Section  VI.  Section  IV  contains  a 
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discussion  of  these  results,  and  comparisons  to  previous  work. 

When  both  f^  and  are  rational,  the  results  given  here  and  those  of  [6] 
provide  a  complete  solution  to  the  information  capacity  problem. 


II.  PRELIMINARIES 


A  basic  assumption  to  be  used  throughout  this  paper  is  that  the 
constraint  is  given  in  terms  of  a  covariance  function  r^  defined  by  a  spectral 
density  function  f^.  Moreover,  the  Gaussian  noise  process  Z  is  assumed  to  be 
stationary  with  spectral  density  function  f^  and  associated  covariance 
function  r^.  The  integral  operator  in  LgCO.T]  with  kernel  r^  (resp. ,  r^)  is 
denoted  by  R^  j  (resp.,  R^  ^,) .  Since  r^  and  r^  are  both  continuous,  the 
reproducing  kernel  Hilbert  space  (RKHS)  defined  by  each  covariance  function 
for  the  parameter  set  [0,T]  is  isometric  to  the  range  of  the  (positive)  square 
root  of  the  associated  integral  operator  in  L2[0,T]  (see,  e.g. ,  [4a]);  we  use 
the  two  spaces  interchangeably,  noting  that  the  RKHS  is  a  space  of  functions, 
while  the  range  of  the  square  root  of  the  associated  integral  operator  is  a 
space  of  equivalence  classes.  It  should  be  noted,  however,  that  since  we  work 
with  probabilities  on  the  Borel  sets  of  LgCO.T],  letting  T  -+  »,  the  range  of 
the  square  root  operator  is  the  mathematical ly-correct  definition.  In  order 

2  i  ± 

to  have  a  finite  value  of  EliXII^  it  is  necessary  [1]  that  R^  ^.  =  R^  j. 

where  Lp  Is  a  trace-class  operator  with  eigenvalues  (t  (T),  n  >  1),  and  then 


EIIXII 


2 

W.T 


Trace  Lj.  =  2  t  (T) ' 

n£l  n 


(4) 


We  shall  also  assume  that  the  covariance  function  r^  of  the  stationary 
noise  process  Z  is  given  by  a  spectral  density  function  f^: 

rz(t.s)  4  EZ(t)Z(s)  =  J  e1X(t-s)fz(A)dX. 

w  .  OQ 
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In  this  paper  we  consider  the  following  classes  of  admissible  input 
signals . 

Definition  1  (Constraint  on  inputs): 

(1)  9tT  =  3tJ(?)  (T  >  C.  P  >  0)  is  the  set  of  all  xj  =  (X(t):  0  £  t  <  T} 

T 

such  that  the  sample  paths  of  Xq  belong  to  with  probability  one  and 

E[lixjllj  T]  ^  PT.  (5) 

(2)  *  =  9t^( P)  is  the  set  of  all  X  =  (X(t)}  such  that  xj  6  for  all  T  >  0 

with  probability  one  and 


lim  i  E[llxLl?  .]  (  P 
T-*»  **  u 


(6) 


(3)  if  =  5f^(Pi  Is  the  set  of  all  wide-sense-stationary  processes  X  =  {X(t)} 
with  SDF  f^  such  that  f ^  is  bounded  and 


r 


fx(X) 


dX  i  2wP. 


(7) 


Let  f  and  tj  be  random  variables  or  stochastic  processes  with  patns  in  an 
appropriate  space  and  denote  by  pc  and  p^  the  induced  probability 
distribution  of  f  and  the  joint  distribution  of  f  and  tj,  respectively.  The 
mutual  information  I(f,Tj)  between  f  and  tj  is  defined  by 


I(f 


4  |  io*  JOZl 


if  p^  is  absolutely  continuous  with  respect  to  the  product  measure  p^xp^, 

where  dp^/dp^xp^  is  the  Radon-Nikodym  derivative;  otherwise  I(f,r>)  is 

A  T  T 

infinite.  Denote  by  Lp(X,Y)  =  I(Xq.Yq)  the  mutual  information  between  the 
T  T 

input  Xq  and  the  corresponding  output  Yq  of  the  SGC,  and  define  the  mutual 
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information  I(X,Y)  per  second  by 


I(X.Y)  =  lim  Jr  I_(X.Y), 

T-«»  ‘ 

if  the  limit  exists.  In  order  to  be  consistent  with  past  work,  particularly 

[1]  -  [3],  the  induced  measures  are  defined  on  LgCO.T]  for  each  fixed  T. 

The  definition  of  capacity  used  in  this  paper  is  the  mutual  information 
version.  Thus,  the  capacity  of  the  SGC  is  the  supremum  of  I^.(X,Y)  or  I(X,Y) 
taken  over  all  admissible  inputs  X.  As  is  well-known,  the  information 
capacity  is  an  upper  bound  on  coding  capacity. 

Def ini tion  2  (Capacity  of  SGC)-' 

(1)  CT  =  cJ(P)  4  sup  (I  It(X.Y):  Xj  t  «J(P)}.  T  >  0. 

(2)  C  5  C^(P)  4  Tiin  cJ(P)  . 

T-*» 

(3)  C(S)  =  C^(P:*)  =  sup  {1^  i  IT(X.Y):  X  €  3^(P)}. 

T-*° 

(4)  C(y)  =  C^(P;if )  4  sup  (Ti^  i  IT(X.Y):  X  €  ^(P)}. 

7~*° 

T 

The  capacity  C  has  been  determined  by  Baker  ([2], [3])  for  a  general 
Gaussian  channel,  where  W  and  Z  are  not  necessarily  stationary. 

In  order  to  state  and  prove  our  main  results,  we  need  some  properties  of 
mutual  information. 

Lemma  1  (see  [!])•'  Suppose  that  the  input  X  of  SGC  (1)  is  a  Gaussian  process. 
Then  following  conditions  (a)  -  (c)  are  equivalent. 

(a)  IX(X.Y)  <  » 

T 

(b)  The  sample  paths  of  Xq  belong  to  j  with  probability  one. 

(c)  The  operator  R72TRY  rR72T  is  of  trace  class. 
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If  these  conditions  are  satisfied,  then  I^,(X.Y)  is  given  by 

00 

It(X.Y)  =i  2  log(l  +  rn(T)). 
n=l 


_x  _x 

where  t^(T) .  n  =  1.2,...  are  eigenvalues  of  \ 


In  the  case  where  the  input  X  is  also  a  stationary  Gaussian  process, 
there  have  been  various  works  on  the  mutual  information  (e.g., 

[5] , [9] , [ 1 1] , [ 12]) .  A  "well-known"  formula  of  the  mutual  information  per 
second  i s 

7(X-Y)  =kJllog(‘  (8) 

It  is  known  that  there  are  some  mathematical  difficulties  to  derive  (8) 

rigorously.  Pinsker  [11]  proved  (8)  under  some  assumptions.  Here,  for  later 

use,  we  restate  some  of  Pinsker ’s  results  related  to  (8).  We  note  by  F  the  set 

of  all  SDF's  f(X)  which  have  the  form  f  =  p(l  -  with  a  rational  SDF  p  and  a 

measurable  function  <#>  such  that  0  <>  y>(X)  <  1  for  all  X  €  1R  and 

J"_C0|log(l-<p(X))  |dX  <  ®.  A  stationary  Gaussian  process  is  said  to  be 

0  00 

information  regular  it  lim  i(X  w.Xp)  -  u.  lieoeasary  ai.d  sufficient  conditions 

Thbo 

for  information  regularity  are  given  in  [8],  For  example,  if  the  SDF  of  X  is 
rational,  then  X  is  information  regular. 


Lemma  2  [11  (Chap.  10),  9]:  Let  the  input  X  be  a  stationary  Gaussian  process 
with  SDF  f^(X).  Then 

(a)  i  It(X.Y)  >  ^  JjoJ.  - 

(b)  If  f^  €  F  or  fy  €  F,  then  (8)  is  true; 

(c)  If  X  or  Y  is  information  regular,  then  (8)  is  true. 
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For  later  use,  we  denote  by  A  the  class  of  all  pairs  (X.Z)  of  input  X  and 
noise  Z  for  which  (8)  holds. 

>v 

It  is  known  (e.g..  [1])  that  if  X  and  X  =  {X( t ) }  have  the  same  covariance 
function  and  X  is  Gaussian  then  I^.(X,Y)  £  I^.(X,Y),  where  Y  is  the  output 
corresponding  to  X.  Noting  (4)  we  know  that  the  constraints  (5)  and  (6) 
involve  only  covariance  functions.  Thus  in  order  to  find  the  capacity  of  the 
SGC  we  consider  WLOG  only  Gaussian  input  signals. 


We  now  consider  the  SGC  (1)  under  the  various  constraints  given  in  the 
preceding  section. 

Define  a  constant  8  by 


1  +  0 


= inf  ta:  0a  -  "ln(a-  yxT. 


dX 


(9) 


Roughly  speaking,  1  +  8  =  lim  fz(X)/f^(X)  if  f^(X)  and  fz(X)  are  smooth 

X-#» 

functions.  The  SDF  f^  of  the  noise  process  Z  can  be  written  in  the  form 

fz  =  (l+0)fw(l  +  a)  =  (l+0)fw(l  +  a+)(l  +  oj.  (10) 


where  cr+(X)  =  max(a(X),0)  and  ct_(X)  =  min(o(X),0).  We  assume  that  -1  <  0  <  ® 
and 


1  +  a_(X)  *  aQ,  X  €  R.  (11) 

where  a^  >0  is  a  constant. 

For  ease  of  notation,  o+  and  o_  will  be  used  in  the  proofs.  However,  in 
the  statements  of  our  main  results  a  will  be  replaced,  using  the  following 
equality: 


X  €  A 


=  0 


otherwise 
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where  A  =  {X:  fz(X)  <  ( 1+8) f^(X) } .  Note  that  (11)  is  equivalent  to 
(fz/fw)(X)  i 

II T .  MAIN  results 


We  introduce  auxiliary  stationary  Gaussian  processes  V  =  (V(t)}, 
U  =  (U(t)}  and  =  (U^fc^(t)}  (e  >  0)  as  follows.  The  process  V  is 
independent  of  U  and  U^fc'.  and  these  processes  have  SDF’s 


fv(X)  =  fw(X)[l  +  o+(X)]. 

(12) 

fjj(X)  =  -fw(X)[l  +  a+(X)]a_(X). 

(13) 

f  (e) (X)  =  -fw(X)(l  +  a+(X))a^£)(X). 

respectively,  where  a^£^(X)  =  a_(X)*l.-  _  -.(X)  for  X  €  IR 

function  1  j-_ ^  of  a  set  [-l.-e].  Note  that  f^  =  0(fy  - 

with  the  indicator 

fjj)  and  consequently 

R/  T  =  0(Rv>T  ~  ^u.T^ ' 

(14) 

If  the  formula  (8)  is  valid  for  all  Gaussian  inputs  X  €  then  the 
capacity  C(if)  would  be  equal  to 

r<p> 4  7  U  04  *  <i5> 

where  the  supremum  is  taken  for  all  SDF’s  f  satisfying 

Using  the  so-called  water  filling  method  (cf.  [5]. [6])  we  can  determine  T(P). 


Lenrna  3’  Let  A  =  (X:  f^X)  <  ( 1 -»-0) f w( X > >  . 

! .  If  p  ^  57  J'a  1  +  0  "  f^x)]dx*  then 
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p(p)  =  £  v°* 


(1+6)7 V) 

Z 


dX  + 


1  r  ffZ 


2(TTey  +  4*0+0)  JA  [f^X)  “  ( 1+e)  JdX.  (16a) 


n.  If  P^fA 


1  +  9  -  -=H\) 

lw 


dX,  then 


r(P)  =  £  J-A  log 


1 


f 


(  1+0)A(P)tt-(X) 

*  o 


dX 


(L6b) 


where  A^  =  (X:  f^fX)  £  ( l+9)A(P)f^(X) }  and  A(P)  $  1  is  uniquely  defined  by 


P  =  —  / 
2ir  JA 


(  1+8}A(P)  -  "t~( X) 
W 


dX. 


(17) 


In  Part  I.  T(P)  2  P/[2(l+9)].  In  Part  II,  T(P)  ^  P/[2( 1+9)A(P) ]  >  P/2(l+0) 


Theorem  1 :  (a)  Assume  that 


J  [l  +  9  -  ^(X)] 

JAl  W  J 


dX  <  ®. 


If 


and 


vp> c  Vp) 


(all, V)  €  for  sufficiently  small  a  >  0 


(19) 

(20) 
(21) 


are  full  tiled,  where  all  =  (aU(t)},  then  all  capacities  per  second  coincide  and 
are  equal  to  T(P): 

Sr(P;y)  =  S(P:S)  =  VP)  *  S(p)  =  r(p>-  (22) 


(b)  Assume  that 


Li 


i  +  e  -  rHA) 
AL  *W 


dX  <  °°. 


If  (20)  is  satisfied  and 

(aU^£^,V)  €  A  for  sufficiently  small  e  >  0  and  a  >  0, 
then  (22)  holds. 

Theorem  1  is  derived  from  the  following  theorem. 


(23) 


(24) 
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Theorem  2-  Following  (C.l)  -  (C.5‘)  holds, 

(C.l)  r(P)<^(P;^). 

(C.2)  If  (20)  is  satisfied  then 

Cyi?;#)  i  Cy(? ■.*). 

(C.3)  C^P ;3t)  ^  ^(P+fc),  e  >  0. 

(C.3‘)  If  C^( P)  is  continuous  in  P,  then 

C^P:*)  *  C^P). 

(C.4)  C^P)  *  CyCP). 

(C.5)  Under  condition  (19).  if  (21)  is  satisfied,  then 

Cy(P)=r(P).  (25) 

(C.5‘)  Under  condition  (23).  if  (24)  is  satisfied,  then  (25)  holds. 

As  one  can  see.  (20)  and  (21)  are  key  conditions  for  our  assertion  (22). 
Note  that  condition  (20)  is  equivalent  to 

1  T  2  W 

li*  eK''w.t]  *  L  f^Al  ^  x  6  Vp>  ■  W 

If  (8)  is  true  in  general,  then  (21)  holds.  Although  we  do  not  have  a  proof, 
it  is  expected  that  (26)  and  (8)  (or  (20)  and  (21))  can  be  shown,  so  that  (22) 
can  be  true  under  rather  moderate  conditions.  We  can  give  some  sufficient 
conditions  for  (20)  and  (21). 

Theorem  3:  I.  Suppose  that  (19)  is  satisfied. 

(a)  If  f^  €  F,  then  (20)  is  true. 
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f  f 

*  W  1  ^ 

(b)  In  addition  to  €  IF.  let  —  be  continuous  and  (X)  -  (lt-9)  = 

0(X  as  X  -»  ».  Then  (21)  is  true  and  (22)  holds. 

II.  If  and  f^  are  rational,  then  (19)  is  satisfied  and  (22)  holds 

Moreover,  related  to  (20),  the  following  theorem  may  be  of  interest. 

Theorem  4:  Every  stationary  process  in  &^(P)  belongs  to  if^(P). 


IV.  DISCUSSION;  RESULTS  FOR  RATIONAL  SPECTRAL  DENSITIES 


In  this  section,  we  compare  the  results  of  Section  III  with  those  given 

in  Section  8.5  of  [6].  In  order  to  clarify  the  comparison,  we  shall  assume 

that  f^  and  f^  are  rational.  Equations  (19).  (20),  (21),  and  (22)  then  hold, 

according  to  Theorem  3,  and  all  capacities  are  equal  to  T(P)  as  given  by  (16). 

The  model  used  in  [6]  is  as  follows.  The  initial  message  process 

satisf ies  E/J^(t)dt  i  PT  for  every  T  >  0.  The  transmitted  signal,  X,  is 

obtained  by  passing  X^  through  a  linear  filter  with  transfer  function  H.  The 

2 

assumption  is  that  |H(X)  l^/f^  <  Cq  <  «®,  all  X  €  IR,  and  J"^  j  dX  <  09 ■  The 

capacity  is  defined  as  lim  Gj,(P)/T,  where  Gj.(P)  is  the  capacity  for  the 

T-*» 

observation  interval  [0,T]. 

To  put  this  model  into  our  context,  we  first  note  that  f^(X)  =  | H( X ) |  , 

X  €  IR.  Since  all  capacities  are  equal  when  f^  and  f^  are  rational,  we  can 

assume  that  is  wide-sense  stationary,  with  spectral  density  f^,  so  that 

fx(X)  =  f  X(A)  |H(X)  |2  for  X  €  IR. 

2 

The  assumption  that  |H|  /f^  is  bounded  and  integrable  implies  that  the 
RKHS  of  f^  is  a  proper  subset  of  the  RKHS  of  f^.  That  is,  y  in  r°  °°)  belongs 

to  the  RKHS  of  fw  if  and  only  if  J*  »-  dX  <  “>,  where  y  is  the  L„  Fourier 

W  -*  f^(X)  J  2 
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transform  of  y.  Thus.  f^X  i  V^^f^l)^-  so 

that  y  belongs  to  the  RKHS  of  f^.  The  converse  inclusion  cannot  hold;  the 
function  h  with  Fourier  transform  H  belongs  to  the  RKHS  of  Z,  but  not  to  that 
of  W. 

One  of  our  assumptions  is  that  -1  <  0  <  °°.  where  1  +  0  is  the  infimum 
over  all  positive  constants  a  satisfying 


Jl[a  -  f^xl}] 


dX  =  <*>. 


If  f^f^  integrable  and  bounded,  then  f^/f^  “  as  |x|  -*  ®,  and  so  0  =  <*>. 

0  =  »  is  equivalent  to  the  RKHS  of  W  being  strictly  contained  in  that  of  Z.  If 

^Z^W  *s  ^nte8ra^^e'  ^en  0  =  -1;  this  is  equivalent  to  the  RKHS  of  Z  being 

strictly  contained  the  RKHS  of  W.  In  fact,  -1  <  0  <  “  is  equivalent  to 

equality  of  the  RKHS  of  W  to  that  of  Z;  we  omit  the  proof. 

If  the  RKHS  of  W  is  not  contained  in  that  of  Z,  then  Gp(P)  =  00  for  every 

T  >  0.  by  the  results  of  [3].  Thus,  the  results  of  this  paper  and  those  of 

Section  8.5  of  [6]  together  exhaust  all  cases  for  which  the  capacity  is 

finite,  and  the  problems  treated  do  not  overlap,  when  f^  and  f^  are  rational. 

From  an  operational  viewpoint,  the  model  used  in  this  paper  may  be 

preferable  to  that  used  in  [6].  That  is,  one  might  expect  to  attempt  to  limit 

the  transmitted  signal  to  some  part  of  the  noise  band,  defined  by  a  linear 

filter  with  transfer  function  H^.  This  would  be  done  by  filtering  the  received 

2 

waveform.  The  effective  noise  then  has  spectral  density  f^lH^I  .  The  trans- 

2 

mitted  signal  has  effective  spectral  density  fjjH^|  .  In  order  to  provide  the 
maximum  amount  of  information,  one  would  then  wish  to  have  the  spectral  den¬ 
sity  f^  be  such  that  the  signal  sample  paths,  which  are  from  the  process  with 

2 

spectral  density  f^|H|  •  use  as  much  of  the  available  bandwidth  as  possible. 

2  2 
In  essence,  this  means  that  the  RKHS  of  fY|H|  should  equal  that  of  f _ { H |  . 
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This  would  require  that  and  f^  have  the  same  RKHS.  This  cannot  be  achieved 
with  the  model  of  [6],  whereas  it  is  included  in  the  model  used  here.  It  earn 
be  seen  that  the  model  of  [6]  limits  the  received  signal  to  a  relatively  smal 
part  of  the  frequency  region  occupied  by  the  effective  noise  process. 

For  example,  if  the  effective  noise  at  the  receiver  is  described  by  a 
rational  spectral  density  f^  such  that  f^(X)  =  1/a1*  for  |\|  -»  ®,  then  the 
model  of  [6]  requires  that  f ^(X)  =  1/X*^*  as  |x|  -*  00 .  The  model  used  here 
permits  f^(X)  —  1/X^**  as  jX (  -»  In  terms  of  signal  sample  paths,  the  sample 
paths  of  the  signal  process  under  the  model  of  [6]  would  be  required  to  be 
4p-dif ferentiable,  while  under  the  present  assumptions,  the  signal  sample 
paths  need  only  be  2p-dif ferentiable. 

Thus,  when  f^  and  f^  are  rational,  we  conclude  the  following: 

(1)  The  problem  treated  in  Section  8.5  of  [6]  and  the  problem  treated  here 
exhaust  all  situations  where  the  capacity  is  finite. 

(2)  The  problem  treated  here  does  not  overlap  with  the  problem  treated  in 

[6]. 

(3)  The  model  treated  here  appears  to  have  some  advantages  over  the  model  of 
[6],  from  an  operational  viewpoint. 

We  now  summarize  our  main  results  for  the  case  where  f^  and  f^  are 
rational,  and  include  the  main  result  given  in  [6]  and  the  case  0  =  -1  in 
order  to  cover  all  possible  solutions. 

Theorem  5.  Suppose  that  f^  and  f^  are  rational.  Then  the  capacity  with  the 
input  process  required  to  be  stationary  is  the  same  as  the  capacity  without 
this  restriction:  C^(P ; bf)  =  C^(P;9l).  For  -1  i  0  i  ®,  the  value  of  the 
capacity  is  as  follows. 
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1. 


T 

P;S)  =  ®,  Cyy(P)  =  00  for  every  T  >  °°,  and  this 
T 

holds  with  the  additional  assumption  that  Xq  Is  a  one-dimensional 
process . 


9  =  -1.  In  this  case  C^( 


2.  -1  <  0  <  ®.  Let  A  =  {X:  fz(X)  <  (l+9)fw(X)}. 

I.  If  P  *  2w  J*A  1  +  0  "  'T<X)]dX-  then 

r  £ 

<yr:*>  =  57  ^logfo^Mjdx  *  /a[^m  -  (i+e)]dx. 


II.  If  P  *  ^  JAJl  +  0  -  -p<X)]dX,  then 

VP:90  =  4^Xa  log[(l+0)A(P)^]dX  where 
£  Z 


Aj  =  {X:  f^X)  £  ( l-«^) A( P) F^(X) }  and  A(P)  £  1  is  uniquely  defined  by 

P  =  ^  XA^log  £(  1+0)A(P)  -  ^(X)  dX. 

In  Part  I.  C^P.-S)  2  P/[2(l+0)].  In  Part  II. 

C^P-.St)  l  P/[2(  1+0)A(P)]  2  P/2(l+0). 

3.  0  =  ®.  Then  [6] 

S(P:2<)  =  47XA  log[B(P)^(X)]dX. 

2  Z 


where  Ag  =  {X:  f^fX)  £  B(P) f^(X) }  and  B(P)  is  uniquely  determined  by 


p  =  5Tja,[b<p> 


Moreover,  in  part  II  of  2)  and  in  3),  capacity  can  be  attained  with  a 
stationary  Gaussian  signal  process  X  with  spectral  density  f^.  For  Part  II  of 
2).  f^  is  defined  by 

fx(X)  =  ( 1+0}A(P)  -  [fz/fw](X)  X  in  Aj 

=  0  otherwise. 


LISS  39-9GC  -  10/9/89  -  15 


For  3),  f^  is  defined  by 

fx(A)  =  B(P)  -  [fz/ff](X)  A  in  Ag 

=  0  otherwise. 

In  considering  Theorem  5.  it  can  be  seen  that  II  of  Part  2)  is  quite 
similar  to  the  Holsinger-Gai lager  result  given  in  Part  3);  in  fact.  Part  3) 
can  be  formally  obtained  by  substituting  B(P)  =  (1+0)A(P)  in  II.  To  gain 
insight  into  this,  one  can  compare  these  results  with  those  contained  in 
Theorem  3  and  Corollary  4  of  [3].  Specifically,  Part  I  above  should  be 
compared  with  Part  (a)  of  Theorem  3  in  [3];  II  of  Part  2)  above  should  be 
compared  with  Part  (b)  of  Theorem  3  in  [3];  and  Part  3)  should  be  compared 
with  Corollary  4  of  [3], 

The  question  of  attaining  capacity  (in  particular,  by  a  stationary 

T 

process)  in  I  of  Part  2)  is  still  open.  For  any  finite  T,  C^(PT)  cannot  be 
attained  when  the  conditions  of  Part  I  are  satisfied  (see  Theorem  3(e)  of 
[3])- 

V.  LEMMAS 

In  this  section  we  shall  give  several  lemmas  needed  to  prove  our  main 

results.  Proofs  of  the  lemmas  will  be  given  in  the  Appendix. 

00  00 

Hereafter,  for  brevity,  2^  signifies  2^_j  and  J  signifies  f  m  unless 
noted  otherwise. 

We  define  an  operator  Slj,  on  LgCO.T]  by 

“  0  ^/T^.T^T-  l’  ^ 2 7) 

where  I  is  the  identity  operator. 

Lemma  4-  Suppose  that  the  condition  (21)  is  satisfied.  Then  &j.  is  a  negative 
definite  trace  class  operator  satisfying 
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(28) 


The  eigenvalues  An(T),  n  =  1,2 .  of  S^.  are  bounded  by 

-1  +  aQ  *  XJT)  *0.  n  =  1.2.....  (29) 

where  Is  the  constant  of  (11). 

The  capacity  of  a  mismatched  Gaussian  channel  is  given  in  [3].  That 
result  can  be  directly  applied  for  our  SGC.  We  arrange  the  eigenvalues  of  S^, 
in  increasing  order:  A^(T)  S  A^fT)  S 

Lemma  5  ([2], [3]):  Suppose  that  (19)  and  (21)  are  satisifed.  Then  C^(P)  are 
given  by 

C>)  .  y  Vog  [(HXn(T))-1]  -  le  *  If  Vn<T>- 

if  PT  ^  -0  2nAn(T).  (30a) 

l  K  rl^jA^T)  +  0-1PT  +  K-. 

=  W  n=ll0g  '  K(l+Xn(T))  ;  • 

if  PT  <  -e  5  X  (T).  (30b) 

where  K  =  K(T)  =  K(T,P)  is  the  largest  integer  such  that 

K  -1 

Z^T)  +  0  l7T  l  KA^T).  (31) 

Remark :  We  note  that  if  the  SGC  is  a  matched  one  then  S^.  =  0  and  (30)  is 
reduced  to 

cJ(P)  =  P/(20) .  P  >  0.  (32) 

To  examine  the  asymptotic  behavior  of  the  eigenvalues  (A^fT))  as  T  -»  », 
we  prepare  a  lemma.  Let  (to(T))  (T  >  0)  be  a  summable  sequence  and  t(A), 


cJ(P) 
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X  €  IR,  be  an  integrable  function  such  that 

0  *  rn(T)  i  M.  0  *  t(X)  £  M. 

for  all  n  =  1.2,...,  T  >  0  and  X  €  R,  where  M  is  a  constant.  Let  F  be  a 
bounded  piecewise  continuous  function  defined  on  [0,M]  such  that  F  is 
continuous  at  x  =  0  and  F(0)  =  0. 


Lemma  6:  (a)  Suppose  that  {t  (T)}  and  r  satisfy 


n' 


lim  1  2nlog  (l+orn(T))  =  ^  J  log  (l+crr(X))dX 


T-*» 

for  every  0  <  a  £  a q,  where  a ^  is  a  constant.  Then  it  holds  that 


In  particular. 


Th® 

(b)  Suppose  that 


F(t„(T))  =5iJF(T(M)dA. 

£  r  WT>k  *  s? 1 T<x>k<ix-  k  -  . 


lim  log(  1  +  crrn(T))  l  ^  J  log(l  +  crr(X))dX 
T 


for  every  0  <  a  £  a^.  Then 


lim  f  5n  rn(T)k  *  gL  J  r(X)kdX,  k  =  1,2 . 


Lemma  7:  For  any  X  €  9^(P),  it  holds  that 


_-L 


or  equivalently 


Trace  RV*TRXTRy*T  $  Trace 


E[llxJlly  T]  $  E[llxJllJ>T], 


(33) 


(34) 


(35) 


(36) 


(37) 
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_2 

Lemma  8:  Let  f  be  a  continuous  SDF  such  that  f(X)  =  0(X  )  as  X  -»  <*».  Then  for 

any  e  >  0  there  exists  a  rational  SDF  g(X)  such  that  f(X)  <  g(X) ,  X  €  IR.  and 


[g(X)  -  f(X)]dX  <  e. 


VI.  PROOF  OF  THEOREMS 

At  first  we  prove  Theorem  2. 

Proof  of  fC.ll  -  (C.41:  (C.l)  follows  from  (a)  of  Lemma  2.  (C.2)  is  clear  from 
the  definition  of  the  capacities.  Let  X  €  Vp>-  Then 

f  E[llxJll^T]  i?  +  e 

for  sufficiently  large  T.  Therefore 

17^  y  xt(x,Y)  *  cj(p+e) 

T-*° 

T  T 

and  we  obtain  (C.3).  It  is  clear  from  Lenina  7  that  9t^(P)  C  &y(P),  yielding 
(C.4).  (C.3‘)  is  clear  from  (C.3). 

Proof  of  (C.5):  For  each  a  >  0  we  define  a  process  =  (Q^a^ ( t ) }  by 

Q^(t)  =^U(t)  +  V(t).  Then  by  Lemma  1  and  (28),  we  have 

IT(U.Q(a))  =  t  2nlog(l  -  aXn(T)).  (38) 

where  X^fT),  n  =  1,2,...,  are  the  eigenvalues  of  the  operator  S^.  defined  by 
(27).  Hence,  noting  (12)  and  (13),  the  assumption  (21)  implies  that 

li£  ^  2n  log(l  -  oXn(T))  =  ^  J  log(l  -  oo_(X))dX  (39) 

for  sufficiently  small  a.  We  can  apply  Lemma  6.  replacing  rn(T)  and  r(X)  by 
-Xn(T)  and  -<7_(X),  respectively.  At  first  we  shall  show  (25)  for 
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-9 


P  £  Tjjp  J"a_(X)dX.  It  follows  from  (35)  that 


lim  *4  2 
Th» 


n 


X  (T) 
nv  ' 


li 

2t 


J  o_(X)dX 


1 


P. 


(40) 


Noting  (11)  and  (29)  we  can  apply  (34)  for  F(x)  =  log(l-x)  to  obtain 


|im  f  2n  iogf1  +  *n(T))  =  2r  J  losO  +<7_(X))dA. 


(41) 


Since  Cy(P)  is  given  by  (30).  using  (40)  and  (41).  we  obtain 


Vp>  *  <>>  *  »£  r  <*  \  lorfo+v1))'1]  ♦  S  *  4  WT» 


T-*»  ’  T-#» 

log[(l+a_(X))-1]dX  +  ^|  +  J-  J  o_(X)dX  =  T(P) 


Secondarily  we  shall  show  (25)  for  P  <  ^  /  o_(X)dA.  It  will  be  shown  that 


lira  =  A(P).  (42) 


where  K  =  K(T)  is  the  constant  in  (30b)  and  A(P)  is  the  constant  given  by 
(17).  To  prove  (42)  we  put 

A  =  Vt)(T)*  "  =  Vo(T)* 

There  exists  (T  }  such  that  T  -*  ®  as  n  -»  •  and 
n  n 

Hm  XK(J  j(Tn)  =  A. 

m"  '  n7 


Applying  (34)  for  F(x)  =  lj. _^0^(x)*x  (lg(x)  is  the  indicator  function  of  a 
set  B)  we  obtain 


1  K(V  1 

lim  =—  2  n  X(T  )  lira  =A 
n-*°  n  k=l  n-f»  n 


2  -  w 

k;W*A 


=  57  f  a  (X)dX. 
JD(A)  " 


(43) 
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where  D(A)  =  {X;  o_(\)  i  A} .  By  definition  we  have 


FT 


K(T)+1 

l  \(T)  +  r  <  (K(T)+1)Xk(t)+1(T). 


From  (31),  (43)  and  (44).  we  obtain 

1  1  K(Tn>  P 

“I  -  K<Tn>*K(T  )<Tn>  =  r  *  W  +  B 
nr»  n  n'  n-*»  n  k=l 

=  o7  f  -  oJ\)6X  +  £  - 

^  JD(A)  0 

On  the  other  hand,  applying  (34)  for  F(x)  =  lr  -j-  -.(x)  we  get 

L  A.oJ 

lim  K(Tn)XK  (Tn)  =  A  lim  ^  #{k;  X.(T  )  £  A} 
n-*»  n  v  n'  ih»  n 


(44) 


(45) 


(46) 


=  27  A|D(A)|. 


where  #B  denotes  the  cardinal  number  of  a  set  B  and  |d|  denotes  the  Lebesgue 
measure  of  a  set  D.  By  (45)  and  (46)  we  know  that  A  is  a  solution  of  (17). 


Since  (17)  has  the  unique  solution  A(P),  A  is  equal  to  A(P).  In  the  same  way 
it  is  shown  that  A  is  also  equal  to  A(P).  Thus  we  get  (42).  In  the  same  manner 
as  above,  applying  (34),  we  obtain  the  following  equations. 


limSp..  limi#{k;  X^T)  $  A(P)}  =  £  |D(A(P))|. 


1  K<T>  2  1 

lim  i  I  log(l+X  (T))  =  lim  Z  ^  log(l+X  (T)) 

T-*»  n=l  n  T-»o  Xn(T)$A(P)‘  n 

=  57  |  log(l+o  (X))dX, 

JD(A(P)) 

1  K(T)  1  1  r 

lim  f  2  X,  (T)  =  lim  f  2  X,  (T)  =  5^  o  (X)dX. 

T-*»  k=l  K  T-«  \(T)^A(P)  K  JD(A(P)) 


(47) 


(48) 


(49) 
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It  follows  from  (30),  (47)  -  (49)  and  (17)  that 


Cy(P)  =  Hi!,  Cy(P) 

Th* 


=  lim  ^  2  Iog(l+Xn(T))  +  1  lim  logfl  +  T  \(T)  +  |> 

L  '  1  k=l 


2T 

Tho  n=l 


4w 


T-*» 


K(T) 


=  ~  [  Iog(  I-kt  (X)  )dX 

JD(A(P)) 


D(A(P) ) 


a_(X)dX  +  2^} 


Proof  of  (C.5')-’  Let  tS^  =  (Z^£^(t)}  (t  )  0)  be  a  stationary  Gaussian  process 
with  f^(X)  =  9fy(X)  (  1+ct^£^(X)  )  as  the  SDF  and  consider  an  SGC 

Y(fc)(t)  =  X(t)  +  Z(£)(t),  t  €  1R.  (50) 

From  (9)  it  is  clear  that  -/a^£^(X)dX  <  ®.  Therefore,  we  can  apply  (C.5)  to 
get  the  capacity 


C<£)(P)  4  lim  i  sup  (It(X,Y(£));  xj  €  *J(P)} 

T-*» 

of  SGC  (50).  Noting  (23)  it  can  be  shown  for  every  fixed  P  that 
r<e) 


(51) 


Cy  *(P)  =  r(P)  for  sufficiently  small  e  >  0. 


(52) 


Since 


f{£)(X)  *  fz(X), 
using  Lemma  1  we  can  show  that 
IT(X.Y{£))  $  It(X.Y) 


so  that 
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(53) 


c<£)(p)  $  cyp). 

On  the  other  hand  it  can  be  shown  that 

lim  It(X,Y(£))  =  It(X.Y). 
e-O 

And  consequently,  for  every  5  >  0,  we  can  show  that  there  exists  >  0  such 
that 

Cy(P)  ^  C<£)(P)  +6.  0  <  £  i  £0.  (54) 

The  desired  equation  (25)  follows  from  (52)  -  (54). 

Proof  of  Theorem  1 :  (a)  It  follows  from  (C.l)  -  (C.5)  that 

r(P)  i  Cw( P ; y)  $  C^P ;S)  1  (^(P+fe)  (55) 

S  CyfP+fe)  =  r(P+e).  t  >  0. 

Since  T(P)  is  continuous  in  P,  we  obtain  (22)  from  (55). 

(b)  can  be  shown  in  the  same  way. 

Proof  of  Theorem  3:  I.  (a)  Let  X  €  By  definition  t(\)  =  f^(X)/f^(A)  is 

bounded  (0  £  r(A)  £  M  <  00 )  and  Jr(X)d A  £  2wP.  Denote  by  rn(T) .  n  =  1,2, ... , 

_x  _x 

the  eigenvalues  of  pR^2p.  Let  f  =  (f(t)}  and  f  =  (C(t)}  be  mutually 

independent  Gaussian  stationary  processes,  which  are  independent  of  X  and  W. 

with  SDF's  f_  =  aiwT  and  f_  =  f  (1-crr),  where  0  <  a  <  1/M.  And  let  f  be  the 
f  W  f  W'  q 

SDF  of  the  output  process  q  =  (q(t)}  of  an  SGC 

q(t)  =  f(t)  +  C(t).  t  €  F. 

Then  it  is  clear  that 

f  *  f_  +  f_  -  f_  €  F. 

V  f  C  W 

Therefore,  by  (b)  of  Lemma  2.  we  h»ve 

*«•">  -  gj  ('  *  -PffixjK  f56' 
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We  consider  another  SGC 


0(0  =  f(t)  +  w(t). 


Since  f^.(X)  i  f^(X)  for  1  X  €  IR .  using  Lemma  1  we  can  show  that 


IT(f.n)  >  IT(f.0)  =  2  2nlog(l  +  arn(T)). 
Using  (a)  of  Lemma  2.  (56)  and  (57)  we  obtain 


_1_ 

4m 


log( l+ar(X) )dX  £  llm  2  log( 1+ot  (T ) ) 

T_)a>  n  n 


(57) 


i  ,ira  zf  2nlog(l+aTn(T))  i  I(?.h) 

T-*» 


=  47  W1  *  iSrsbK 

Since  a  >  0  is  arbitrary,  in  the  same  way  as  Lemma  6,  we  can  derive 

Urn  i  Inr„(T)  S  57  J  T<X)^ 


(58) 


(59) 


from  (58).  Noting  (4)  we  know  that  (59)  is  equivalent  to  (26).  and  to  (20). 

(b)  By  Lemma  8.  for  any  e  >  0,  there  exists  a  rational  SDF  g(X)  such  that 
0  £  -a  <  g(X) ,  X  €  IR.  and 

|  [g(M  +  a_(X)]dX  <  e.  ( )  ) 

J  —a> 

»V  A, 

Let  U  =  {U( t ) }  be  a  Gaussian  stationary  process  with  SDF  f^(X)g(X)  and 
independent  of  the  process  V.  Let  a  >  0  be  fixed  and  define  processes  = 

{W  (t)>  and  W  =  (W  ft)}  by 


a '  "  '  a' 


W  (t)  =  aU( t)  +  V(t)  and  Wfl(t)  =  aU(t)  +  V(t). 


Since  g  is  rational,  the  SDF  f  *g  of  U  belongs  to  F.  Hence  (aU.V)  €  A  and 


W'V  *  17  Jj4  *  -  47  Jj4  *  ^ 


dX. 


(2) 


Noting 
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f jj( X)  =  -fw(X)(l+a+(X))a_(X)  =  -fw(X)a_(X)  *  fw(X)g(X)  =  L(\), 

we  can  show  that 

IT(U,Wa)  $  IT(U.ia).  T  >  0. 

Using  Lemma  2  we  have 


(3) 


^5  T  ¥“■'«>  >  V  Jl '°4 *  Trl"  -  3T  LIog 


Ti¬ 
lt  follows  from  (1)  that 


2 

a  a 


1+CT 


dX.  (4) 


,  a  g  .  a  a_ 

4?  I  lo*  1  *  uT  &  -  5T  J  J°*  1  -  VST. 

^  — <X>  +  *4  v  — <» 


dX 


-^.M1  * 


a2(g+a_) 


2 

l+or+-a  a 


dX 


*  £  f  log[l  +  a2(g+<7_)]dX  i  £  J”  (g+<7_)dX  <  2JL.  (5) 


Using  (2)  -  (5)  we  have 


I(U.Wa)  *  lim  I  IT(U.Wa)  ^  lim  i  IT(U.Wa)  >  I(U.Wa) 
T-*»  T-*° 


2 

a  e 
4tt 


(6) 


Since  t  >  0  is  arbitrary,  (6)  means  that 

T(U-v =  lf  v°-v  -  5-  Jl 1 M1  *  -ttK 


Thoo 


implying  (aU.V)  €  A. 

II.  If  f^  and  f^  are  rational,  then  a(X),  ct+(X)  and  a_{X)  are 


continuous.  Noting  that  0  <  0  =  lim^gf^XJ/f^X)  <  ®.  we  can  easily  see  that 

_2 

a(X)  =  0(X  )  as  X  -*  ®.  Consequen 

(b)  are  satisfied  and  (21)  holds. 


-2  -2 
ct(X)  =  0(X  )  as  X  -*  «.  Consequently,  cr_(X)  =  0(X  ).  Thus  the  assumptions  of 
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Proof  of  Theorem  4:  Let  X  be  an  arbitrary  stationary  process  belonging  to 
9t^(P),  and  define  t  and  {Tn(T)}  as  in  the  proof  of  Theorem  3.  Since  the  first 
inequality  nf  (58)  is  valid  for  all  sufficiently  small  a  >  0.  using  Lemma  6  we 
obtain 


d  J  T  V„<T>  * p 


(60) 


This  means  that  X  €  y^(P). 


APPENDIX 

Proof  of  Lemma  3:  Let  us  introduce  some  notations.  Let  S(P)  be  the  class  of 
all  SDF’s  f(X)  which  satisfy 

J  f(X)dX  £  2wP/0 

and  J(f)  be  a  functional  defined  by 

J‘f>  =  5?J  >4  ♦i£&rK 

Then,  noting  (10)  and  (15),  we  have 

T(P)  =  sup  (J(f);  f  €  S(P)}.  (61) 

We  shall  use  the  following  inequalities, 

log(l+x)  £  x.  x  >  -1,  (62) 

*4  ♦£]>?<><«**<»•  <63> 

Suppose  first  that  P  £  ^  /  a_(X)dX.  Denote  by  T(P)  the  RHS  of  (16a).  If 
f  €  S(P),  then  from  (62) 
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dX 


*rj(f)  =  J[log[T^T]  - 

=  J[1os[tSJm]  *  log['  *  <’-(M  * 

S  f1°g[l+g1(X)]dX  *  |a_(x)dx  ♦  Jf(X)dX 

$  Jlog  [^g^x'JdA  +  Jcr_(X)dX  +  2vP/0  =  47r?(P). 

This  inequality  means  that 

r(P)sr(P).  (64) 

From  (9)  and  (10)  we  know  that 

J[(l+6)  -  min  {(1+5),  (l+o(X))}]dX  =  ®. 

for  every  6  >  0.  Hence,  for  every  fixed  6  >  0,  there  exists  a  function  u(X) 
such  that  0  £  u(X)  £  6,  u(X)  =  0  If  cr(X)  £  5,  and 


Ju(X)dX  =  +  Jcr_(X)dX. 


(65) 


Define  an  SDF  f  by  f(X)  =  u(X)  -  a_(X).  Then  f  €  S(P).  Noting 
u(X)/(1+ct+(X) )  £  u(X)/(l+6)  and  using  (63)  we  obtain 


JIlog 

1 

+  log 

Cl  ,  u(X) 

[l+a_(X)J 

L  i+o+(x) 

JH 

[  1 

+  log 

fl  +  u(x)ll 

l1+ct_(X)J 

[l  *  trill 

[  1 

1  .  uIM 

^1+CT_(X)J 

1  1  + 

25  J 

(66) 


It  follows  from  (61),  (65)  and  (66)  that 

s  J rerrxF*  *  rh  (¥*  p-oH- 


(67) 
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Since  5  >  0  is  arbitrary,  (67)  gives 


r( P)  i  r(P). 


(68) 


The  equation  (16a)  follows  from  (64)  and  (68).  Next,  we  assume  that  P  < 

-  2~-  Jo_(X)dX.  Define  sets  B_  =  B_(P)  and  B+  =  B+(P)  in  IR  by  B_  =  {X; 

ct(\)  i  A(P)}  and  B+  =  (X;  ct(X)  >  A(P)}.  Note  that  o(X)  =  a  (X)  for  X  €  B  By 

(17)  it  is  clear  that  a  SDF  f^.  defined  by 

A(P)  -  cr_(X).  X  €  B_, 


belongs  to 


CL 

. 

S(P).  For  any  f  in 


X  €  B+, 


S(P). 


Jf  *  JfQ  =  2irP/6 

ind  we  wish  to  show 


that 

M1  *  1^]  i  Jlog[‘  * 
M^l  ‘  «• 


.  r  f-fc 


(69) 


(70) 


so  that  4wJ(f)  <;  4wJ(f0).  It  follows  from  (61)  and  (70)  that  T(P)  =  J(fQ), 
yielding  (16b). 


Proof  of  Lemma  4:  The  relation  (28)  is  clear  from  (27)  and  (14),  and  implies 
that  is  negative  definite.  The  assumption  (21)  means  that  1^(11, Q^)  <  00 
where  '(t)  =  aU(t)  +  V(t).  Thus  S^.  is  a  trace  class  operator  by  Lemma  1. 
Noting  (11)  -  (13)  we  know  that 
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vx> 

tyX-J  =  i  1  -  aQ,  on  {fw  *  0).  (71) 

From  (71),  Ry  T  1  (l-a^Ry  r  so  R^-pRy .-fRy/T  *  C1-^)1*  or 

Sy  *  (-1  +  aQ)I,  (72) 

where  I  Is  the  Identity  operator.  The  relation  (29)  follows  from  (72). 

Proof  of  Lemma  6-'  (a)  At  first  we  shall  prove  (35)  by  induction.  We  use  the 
following  inequality, 

K-l  k-1  k  K 

|log(l+x)  -  2  ^  -  |  i  .  0  i  x  $  1.  K  =  2.3 .  (73) 

k=l  K  * 

From  (33)  we  know  that  (T  ^JI^logfl+ar^T})  is  bounded  for  sufficiently  large 
T.  Hence  (T  *)2nTn(T)  and,  consequently,  (T)  are  bounded.  Therefore, 

there  exists  constants  and  T^  such  that 

T  Vn(T)2  <  Kr  TiTr  (7<) 

For  any  fixed  e  >  0  there  exists  cij  >  0  such  that 

0  i  Jr(X)dA  -  ^  Jlog(l+oT(X))dX 
1  |  Jr(A)2ciA  <  2we,  0  <  a  *  (75) 

Similarly,  we  know  that  there  exists  Og  >  0  such  that 

0  (  h  t  (T)  -  4  1  log( 1+aT  (T)) 

T  n  nv  ’  aT  n  nv  '' 

i  gf  Vn(T)2  <  fel  0  <  a  ^  a2,  T  *  Tj.  (76) 

Fix  a  number  i  min(ag,aj ,a2) .  It  follows  from  (33)  that 
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U03T)  12nlog(l+a3Tn(T))  '  a3-r(X))dA  |  <  e.  (77) 


T  i  T2. 


where  T _  is  a  constant.  From  (75)  -  (77)  we  obtain 

If  Vn(T)  "  2w  JT(X)dXl  <  36 •  T  i  max(TrT2). 

yielding  (35)  for  k  =  1.  We  now  assume  that  (35)  is  true  for  k  =  1, 
Then, 

i?lf  v»<T>K  -  4  JT<X>K<*' 

<  '4  .V-IJF-k<T  Vn<T)k  '  4  I 

, iK-1  ,  , >k-l  k  ,  ,  ,  r  , 

*  ,  (  ’k  “  4  V»‘T>  -  4  J-CX)kdA) 


. . ,  K-l 


The  second  term  on  the  RHS  of  the  inequality  converges  to  zero  as  T  ->  »,  for 
all  a  >  0.  by  the  induction  hypothesis.  The  first  term  is  majorized  by 


1  *  t-l)k~?«-  Vn(T)k  -  -V  y«(l«rn(T)) 

Ta  k=l  Ta 


1_  y  L_L1 - —  fT(X)kdX - -  flog(l+ar(X))dX 

J  2rra  J 


-  K  ■ “1  k 

2va  k=l 


+  l-TT  2  log(l+aT  (T) ) - flog( 1+ot(X) )dX 

Ta  n  n  2raK  J 


The  last  of  these  three  terms  converges  to  zero  as  T  -*  «,  for  all  a  <  a q,  by 


(33).  Using  (73),  the  first  two  are  majorized  by 
T(K+1 )  2nTn(T)K  +  2w(K+l)  Jt(X^  ^ 


<  JL  j  T  (T)2  ♦  *F-L  JL  f, 

S  K  +  1  2T  n  nl  '  K  +  1  2w  J 


2  sm*’1 

T(xrdx  i  gVj  e 
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for  a  i  min by  (75)  and  (76).  This  proves  (35).  We  turn  now  to  the 
proof  of  (34).  For  any  fixed  e  >  0.  there  exist  polynomials 


P(x) 


Q(x) 


such  that  P(x)  i  F(x)  i  Q(x),  0  £  x  <,  M,  and  J^{Q(t(x))  -  P(-r(x))}dx  <  e.  It 
is  clear  that 


lim  T  4  P(t  (T))  £  lim  T  4  F(t  (T))  £  lim  T  4  F(r  (T) ) 

T-*»  n  n  *  ^  n  nv  n  nv 

$  lim  T~4q(t  (T)). 

Th»  n  n 

Using  (35)  we  get 

lim  T_12nP(rn(T))  =  (^’^(^XJJdX  *  (Ibrf  ^(rfX) )dX  -  e. 


(78) 


.-I, 


llQ 


lim  T  2  Q(t  (T))  =  (2w)  Q(r(X))dX  £  (2w)  F(r(X))dX  +  e. 

Thbo  n  n 


(79) 

(80) 


Since  e  >  0  is  arbitrary,  (34)  follows  from  (78)  -  (80).  Finally,  (b)  can  be 
shown  in  the  same  manner  as  (a). 

Proof  of  Lemma  7:  Since  fy(X)  =  F^( A) ( l+o+(X))  2  f^(M  for  all  X  in  IR,  we  have 

-1  -1  J-  _i  x 

Ry  j  >  j  and  consequently  Ry  y  i  4  T’  Therefore,  ^Jly  < 

x  _i  x 

R^  ^R^  and  finally  we  obtain 

Trace  S^AA^T  *  Tra~  4A!At  *  Trace  4 AVI.! 


-  Trace  /T^.T^W^T 


The  inequality  (37)  follows  from  (36)  and  (4). 
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Proof  of  Lemma  8:  There  exist  A  >  0  and  a  >  0  such  that  a  rational  function 
<p(X),  defined  by 


^.(X)  =  a(A2  +  X2)  1  (A  =  3ire  1 ) , 

saM  sf  ies 


f(X)  i  WX).  M  i  A. 


Note  that 


f 

J  — o 


<#>(A)dX  =  3wA 


-1 


Using  Weierstrauss ’  theorem,  we  can  show  that  there  exists  a  rational  SDF  v//(X) 
satisfying  f(X)  £  +(A)  for  |x|  £  A,  0  <  *(X)  £  f (X)  +  e/(12A)  for  |x|  <  2A  and 
J|X|^2a^^)  £  fc/3.  define  the  function  g(X)  by  g(X)  =  <p(X)  +  '/'(X).  Then  we 
can  show  that  f(X)  £  <//(X)  £  g(X)  for  |X|  £  A,  f(X)  £  <#>{X)  £  g(X)  for  |x|  >  A 
and 

£  f  [g(X)  -  f (X)]dX 

J  op 

S  |  -  f(X)]dX  +  I  'KMdX  +  U>(X)dX  <  |  . 

J0  J2A  J0 
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